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Abstract — For the purpose of adding to the time or space effi- 
ciency, selective enforcement of strictness is commonly practiced 
in today's lazy programming. Although it plays a key role in 
equational reasoning about programs, many few studies have 
considered observational equivalence between lazy programs in 
presence of selective strictness (OELPPSS). 

Gabbay et al. were first to consider OELPPSS and Haeri later 
completed their work. Both Gabbay et al. and Haeri build on 
a variation of the operational semantics of van Eekelen and de 
Mol which, in return, extends Launchbury's semantics for lazy 
evaluation to selective strictness. Gabbay et al. and Haeri choose 
to manipulate the operational semantics of van Eekelen and de 
Mol to prevent increase in heap expressiveness upon expression 
evaluation. This improvement helped them to prove their desired 
observational equivalences using a novel proof technique called: 
induction on the number of manipulated bindings (INMB). They 
used INMB to prove a handful of interesting results including a 
couple of observational equivalences. However, their operational 
semantics suffers from restrictions in expressiveness. 

In this paper, we present yet another variation of van Eekelen 
and de Mol. Our operational semantics is as expressive as that 
of van Eekelen and de Mol. We prove that INMB is valid for our 
operational semantics too. Therefore, all the interesting results 
of Gabbay et al. and Haeri including their observational equiv- 
alences remain valid for our system as well. This is whilst, like 
that of Gabbay et al. and Haeri, our operational semantics avoids 
increase in heap expressiveness upon expression evaluation. 

I. Introduction 

The first operational semantics for lazy evaluation that cor- 
rectly models sharing is that of Launchbury [ ], which is based 
on the simple notion of heaps, van Eekelen and de Mol [2] 
later extend Launchbury's semantics to model lazy evaluation 
in presence of selective strictness. Gabbay et al. [3] and Haeri 
[4] are the first to consider observational equivalence between 
lazy programs in presence of selective strictness. However, 
some reasonable lazy programs such as let y = Xx.x in Xx.y 
and let x' = Xx.x in (let {xi = Xx.x,x% = Xx.Xi} in x 2 ) x' 
do not reduce in their system. I.e., the results of [ , ] work 
for a restricted model of lazy evaluation. 

The novel technique of induction on the number of ma- 
nipulated bindings was first introduced by [ ] and [4]. The 
key results which validate this technique are three theorems 
on atomic variables. (Atomic variables can be thought of 
as the units of change in lazy evaluation.) The first two 
results tell us how to obtain a new (valid) derivation by 
removal/restoration of atomic variables in a derivation. The 



third states that every non-trivial lazy computation evaluates 
at least one atomic variable. In this paper, we prove these three 
theorems (Theorems IV.9, IV. 10, and IV. 11, respectively) for 
our own operational semantics. As a result, the observational 
equivalences proved in [ $,4] are valid in our system too. 

The main contributions of our paper are as follows: We 
provide a new operational semantics for lazy evaluation in 
presence of selective strictness. Like [3,4], we choose to 
garbage collect let-bindings. Thus, our operational semantics 
— unlike its predecessors [ , ] — does not suffer from the in- 
crease of heap expressiveness upon let-expression evaluation. 
On the other hand, unlike [3,4], our garbage-collecting (let) 
rule, retains the (possibly manipulated) let-bindings as a part 
of the returning value. Hence, as stated by Theorem VI. 2, our 
operational semantics is as expressive as that of van Eekelen 
and de Mol [2]. In other words, unlike its antecedents [3,4], 
our operational semantics does not suffer from restriction in 
expressiveness. We show the usefulness of our operational 
semantics by proving the validity of INMB for it. We provide 
the first precise formulation of the INMB principle. 

This paper is structured as follows: We start in §11 by 
presenting our syntax and operational semantics. §111 explores 
some interplays between heaps and derivation trees. In §IV, 
we study atomic variables and prove our main results which 
validate INMB for our system. INMB is outlined in §V. To 
demonstrate the use of INMB, §V provides an example proof 
using it and also mentions a couple of interesting results 
proved in [3,4] using this new inductive principle. Out litera- 
ture review comes in §VI. Finally, §VII concludes this paper 
and explores future work. 

Whilst only proof sketches are provided here, proofs which 
differ between our system and [3,4] come in [ ]. We do not 
reproduce the similar proofs of [ ]. Proof of Theorem VI. 2 
comes in [ ']. We inherit syntax, definitions, and some expla- 
nations from [3,4]. 

II. Syntax and Operational Semantics 

We start by presenting some syntax (Definition II. 1) to 
serve our big step operational semantics (Definition II. 7). Like 
[2], our operational semantics extends that of Launchbury for 
selective strictness. 

Definition 11.1. Fix a countably infinite set of variable sym- 
bols, x, y, z, . . . and X\, x%, X3, . . . will range over variable 



symbols. Define expressions and values by 

e ::= x \ Xx.e [ ex ] let {xj=ej}" =1 in e | e seq e 
« ::= let {xj=ej}" =1 in Ax.e n > 

e, e', ei, ... will range over expressions, v, v', v\, ... will range 
over values. \et {x i =e i }^ =1 in Xx.e when n = is a syntactic 
sugar for Xx.e, i.e., when there is no \et-bindings. We inherit 
Launchbury 's [ ] standard restriction in that functions can 
only be applied to variables. As stated in [ ], this does not 
reduce expressiveness because we also have let. 

Unlike Launchbury [ ], we quotient syntax up to a- 
equivalence of X- and \et-bound variables. For example, 
Xx.x = Xy.y. The reasons behind this design choice are 
discussed in full detail in [3]— [5]. In particular, this design 
choice dismisses the need for local freshness check in [7]. 

Remark II.2. Call let{xi=ej}™ =1 in Xx.e when n> 1 let- 
surrounded abstractions. In considering let-surrounded ab- 
stractions values, we follow Ariola and Felleisen [ ]. Here- 
after, unless stated otherwise for let {xj=ej}™ =1 in Xx.e we 
assume that n > 1. 

Definition II.3. Define fv(e) the free variables of e by: 

fv(x) = {x} fv(Xx.e) = fv(e) \ {x} 

fv(ex) = fv(e) U {x} 

Mlet {x i=e ,}f =1 in e) = (fv(e) U \J? =1 He,)) \ { Xi }? =1 

fv(e' seq e) = fv(e') U fv(e). 

See Remark II. 8 for why we choose (fv(e) U UILi f v ( e i)) \ 
{ Xi }f =1 and not (fv(e) \ {x<}? =1 ) U LULiM*)- ( This is 
essentially to inherit the way [1] deals with let recursion.) 

Remark II.4. We inherit Launchbury's assumption in that "all 
bound variables are distinct" [ , §3.1]. As a result, we do not 
need to distinguish between let {y\ = ei,j/2 = £2} in Xx.e 
and let {yi = ei} in {^2 = £2} in Xx.e. We therefore choose 
the former expression as a syntactic shorthand for the latter. 
An interesting result of this is that the latter (with the nested 
let-bindings) is then a value in our system. 

Notation II.5. Write e[y/x] for the usual capture-avoiding 
substitution of x by y in e. For example, (Xy.x)[y/x\ = Xy' .y, 
when x, y, and y' are distinct variables. Moreover, for 
e' = let {xi=ei}" =1 in e, when y ^ {^i}™ = i U /w(e'), define 
e'[x/y] = let {x, = ej[x/y]}™ =1 in e[x/y\. Use the standard 
renaming for capture avoiding when y G {xi}f =1 U fv(e'). 

Definition II.6. For a partial function f, define dom(f) = 
{x I f(x) defined}. Call a partial function T mapping variable 
symbols to expressions and such that dom(T) is finite, a heap. 
r, A, 0, and 3 will range over heaps. When x G' dom(T), 
define (T, x 1— > e) such that 

• (r,x 1— ► e)(x) = e, and 

• (r, x 1— ► e)(y) = T(y) when y =/= x. 
Besides, 

. (T,Xi t-y ei)\ =1 = (r,xi h-> ei), and 



(T, Xi » eOLi = ((T, Xi » eO?^ 1 



e n ). 



Definition II.7. Define a big-step operational semantics 

r : e JJ. A : v as shown in Fig. 1 where: 

• In (var x ), by Definition II.6, we assume x^dom(T). 

• In (let), 'xi fresh' means Xi ^ dom(T) and xt ^ 
fv(T(x)) for x G dom(Y), and similarly for A. ' 

Intuitively, r : e 4J. A : v is: 'Trying to evaluate expression 
e in heap T will result in value v whilst the (probably 
manipulated) bindings are stored in A'. 

Remark II.8. In line with Launchbury's convention 
on distinct variable naming, for any expression e = 
let \xi=eiYl =1 in e' to be evaluated, all occurrences of Xj's 
in e address the local let-bound Xj's. Especially, so are the 
free occurrences of Xj's which are therefore needed to be dis- 
tinguished from free variables of e. A particularly interesting 
effect that this has in our operational semantics is how we 
define free variables for let expressions (Definition II. 3). 

The first difference between our operational semantics and 
that of Launchbury [ ] is that we replace their (Lambda) rule 
with our (val) one. This is a reflection of us extending their 
notion of values with the let-surrounded abstractions. The 
second impact of this extension is in our rule for application: 
The returned value of the left branch above the line in (app) 
might be a let-surrounded abstraction. 

We 'garbage-collect' 2 the bindings x, 1— > e\ in the final 
heap (the A) in (let), unlike [ ,.]. (Note that e\ might be 
the same as e^ or not.) This design choice brings us benefits. 
Thanks to 'garbage-collection' that, for instance, let-bindings 
are not propagated 'outside their scope' to the final heap and 
we do not have to reason explicitly 'up to' these choices thus. 
(Examples of results which fail without a garbage-collecting 
(let) rule are Theorem 8.4 and Corollary 8.6 in [4].) 

The rule for let in [ ,9] does not garbage collect. For them, 
heaps are left with extra bindings after evaluation of let- 
expressions. The semantics in [1,9] allows variables to escape 
their scope during evaluation, which is forbidden in our seman- 
tics. For example, in [I], evaluation of let x = Xy.y in Xz.(xz) 
finishes by adding x 1— > Xy.y to the original heap; this could 
then 'accidentally' bind x occurring elsewhere in the next 
expressions to evaluate. Launchbury is well aware of this issue 
and comments on it [ , §3.1]. To avoid 'accidental name- 
clash' in evaluations in his system, Launchbury imposes a 
pre-evaluation normalisation for expressions which renames 
all the variables already bound in heap. This is fine, if we 
just want to evaluate a particular expression, normalised, in a 
particular heap. However, for reasoning about the evaluation 
of classes of programs and proving operational equivalences 
between them, a garbage-collecting rule for let is better. (See 
[ , §11.3] and [ , §6.3] for more.) 

Garbage-collecting (let) rules were first introduced in [ ] 
and [ ]. The difference between our (let) and that of theirs 

'Consistent with [ ] and subsequent work, we allow the possibility that 
Xi G M e j) or x i G M e j) for 1 < h3 < n - 

2 Whilst some might not consider this the best naming, Launchbury [ , §6.2] 
is the first to use 'garbage-collection' in this specific meaning. 



(val) 



T : eJJ. A : v 



(var x ) 



r : ei JJ. : v x 9 : e 2 J| A : v 2 



^■vii-T-.v (T,x^e) : x i}, (A, x^v) : v ' T : e x seq e 2 J) A : w 2 

r : e 4 B : let {xi=ei}? =1 in Ay.e' 9 : (let {xi=ei}™ =1 in e')[x/y] JJ- A : w n>0 



(seq) 



r : ex JJ. A : u 
(r, Xji— >ej)™ =1 :e JJ. (A, x,i— >e^)™ =1 :i> when Xj fresh, 1 < i < n 

T : let {xi=ei}" =1 in e JJ. A : let {x i =e' i }f =1 in v 



(app) 



(let) 



Fig. 1. Our Operational Semantics 



is that they do not return let-bindings as a part of the 
value, whereas we do. It happens that this gives us more 
freedom than them: In their (let) rule, in addition to our 
freshness conditions, they force x, ^ fv(v) too. This is to 
avoid future referral to garbage-collected variables. For the 
operational semantics of [3] and [4], any such referral would 
halt the evaluation because the garbage-collected variables are 
no longer accessible. This is not the case in our system because 
the garbage-collected variables — although no longer bound 
in the heaps — are still available as the let-bindings in the 
returned expression. Therefore, we do not need Xj ^ fv(v). 
Refer to [5, Chapter 6] to see how the following lazy program 
thus reduces in our system but not in that of [3] and [4]: 
let x' = Xx.x in (let {x x = Ax.x,x 2 = Ax.xi} in x 2 ) x' . 

Our rules are applicable based on a best-fit strategy. 
Therefore, the final rule in T : let {xj=ej}" =1 in v JJ. T : 
let {xj=ej}™ =1 in v is always (val) as opposed to (let). In 
retrospect, in T : let {xj=ej}™ =1 in e JJ. A : v, the final rule is 
(let). 

[3] explains the intuition of e x seq e 2 as "evaluate e\\ throw 
the result away ... but keep the heap and use it to evaluate e 2 ". 
Our (seq) rule implements this operational behaviour, and is 
similar to the (StrictLet) rule of CLEAN [2,10]. 

Notation II.9. A derivation is the labelled tree — labelled 
with terms, heaps, and derivation rules from Definition II. 7 — 
that justifies a reduction T : e JJ. A : v. II, Eli, II', Tl x , . . . will 
range over derivations. Write T : e JJ, A : v' as shorthand 
for "T : e JJ. A : v is derivable". Write T : e JJ-n A : v' 
as shorthand for "II is a derivation of T : e JJ. A : v". "_" 
will represent the wildcard in our notation, so that, by T : 
e JJ. A : _', we are clarifying our lack of interest in the final 
value obtained after evaluating e in T. Write T : e JJ-n ' for 
T : e JJ-n _ : _'• For a derivation T : _ JJ., call T the original 
heap. For derivations Yl x and II 2 , notation II 2 C III indicates 
that _ : _ JJ-rii contains _ : _ JJ-n 2 - Due to space restrictions, 
rules of Definition II. 7 are illustrated in a 'turnstile' fashion 
hereafter. For example, our (seq) rule will become 

(r :ei J|9: Wl ,9:e 2 J|A:« 2 )p^ 

r : ei seq e 2 JJ. A : v 2 . 

III. Heaps and Derivations 

In this section we first consider a few fundamental prop- 
erties of our system which we use later. Next, we provide 



some results about the interplay between a heap T and the 
derivations II in which T is the original heap. 

A. Fundamental Properties 

The particularly important result of this section is Determin- 
ism (Theorem III. 2). A variation of Theorem III. 2 is correct for 
Launchbury's system [ ], as well as that of van Eekelen and 
de Mol [2,9,11]. The rest of results in this section, although 
not as important as Determinism, will be handy later. 

Notation III.l. Write II =| e t a II' when II and II' are equal 
up to renaming \et-bound variables. 

The following theorem shows that derivations are unique up 
to an easy renaming, and reduction is deterministic: 

Theorem III.2. // T : e JJ. n A : v and T : e JJ. n ' A' : v' then 
II =iet« n', A = A', and v = v' . 

Lemma III.3. Suppose that T(x) is a value v x and T : e JJ-n 
A : v. Then, for every heap 3 in II, H(x) = v x . In particular, 

A(x) = v x . 

Lemma III.4. IfT(x) = v then V : x JJ. T : v. 

B. The Interplay 

In this section, we divide (var x ) instances into two groups: 
Trivial and Non-Trivial (Definition III.5). We then formalise 
our intuition that trivial instances do not change the heap 
(Lemma III. 7) whilst non-trivial ones do (Lemma III. 8). These 
results are then used to prove that each derivation contains at 
most one non-trivial (var x ) instance (Theorem III. 9). 

Definition III.5. Suppose T : e JJ. n A : v. Write V(U) for 
the set of x £ dom(T) such that II contains an instance of 
(var x ). Suppose II contains an instance of (var x ) like 

I (var x ) 

T : e x JJ-n' T : v x \ 

(T',x t-y e x ) : x JJ. (T',x i-> v x ) : v x . 

Call the instance trivial when II' consists of a single instance 
of (val) (equivalently, when e x = v x ). Call the instance non- 
trivial otherwise. 

Definition III.6. Suppose T : e JJ-n A : v. Define dijf(U) = 
{x e dom(T) | Y(x) ± A(x)}. 

Lemma III.7. Suppose T : e JJ-n A : v and x G V'(II). Then, 
x $ dijf(Tl) iffT(x) is a value. 



Lemma III.8. Suppose Y : e JJ- n A : v. Then: x G diff (II) O 
r(x) is not a value and A (a?) is a value O II contains a 
non-trivial instance of (var x ) for some x G dom(Y). 

Finally, here is a less obvious result: 

Theorem III.9. Suppose Y : _ JJ-n- For each x G dom(Y), 
there is at most one non-trivial instance of (var x ) in II. 

Proof: Rule-based induction and Lemmata III. 7 and III. 8. 



C. Essential Parts of a Heap for a Derivation 

Fhe mission of this section is to provide criteria for distin- 
guishing the essential parts of a heap for a derivation. Fhat is, 
the parts of a heap Y without which a derivation Y : e JJ. A : v 
is not derivable. Theorem III. 14 formalises this. 

Definition 111.10. For an x G dom(Y), define Y[x <— > e] as: 

• Y[x <— > e](x) = e 

• Y[x i— > e](y) = Y(y) when (y =/= x). 
Furthermore, 

• T[xi i— > ej]| =1 = T[xi i— > ei], and 

. T[xi i-> ei]™ =1 = (r[xj i-> ej]™^ 1 )^,, i-> e„]. 
Suppose r : e JJ-n A : v. Then, define IL[x i— > e] as f/ze 
labelled tree obtained from II fey replacing every heap in II 
vw'f/z 0[a; i— > e] i/a; G dom{&); otherwise, leave unchanged. 

Lemma III.ll. Suppose Y : e JJ-n A : v and Y(x) is a value 
v x . Then, H[x i— > v x ] = II. 

Lemma 111.12. For any heap Y and variable y such that 
y <£ dom(Y), (T,y i-> e y )[x i-> e x ] = (r[as i-> e^],*/ i-> e y ). 

Lemma 111.13. Suppose Y : e JJ-n A : w w/zere a; G dom(Y) \ 
V(IY). Then for any e x , Y[x\— >-e x ] : e JJ-mz^ej A[xi— >e x ] : w, 
diff(Ii[x^-e x ]) = diff(J[), and V(U[x^e x })= V(U). 

Proof: By inductive transformation of II into II [x i— > e x ]. 

■ 
Theorem III. 14 expresses that if V : e JJ-n A : v, then II 
only depends on the T restricted to F(II): 

Theorem 111.14. Suppose T : e JJ-n A : v and suppose 
r(z) = r'(z) for every z G X^(TT). Then there exist A' and 
IF such that: V : e JJ-n' A' : v, diff (W) = diff (li), V(W) = 
V(IL), and A'(z) = A(z) for every z G V(U). 

IV. Atomic Variables and INMB 

A. Individual Properties 

Now that we have identified trivial and non-trivial (var x ) 
instances, and identified the essential parts of a heap for a 
derivation, we can move on to consider a notion of unit of 
change. Intuitively, these are computations which do cause 
changes in a heap, but cause a minimal change. For this, we 
particularly underpin the variables evaluation of which causes 
such an impact — which we call atomic. Definition IV 1 
makes this formal, and the rest of this section explores the 
characteristic properties of atomic variables. These properties 
will then be used in §IV-B to prove our main results. 



Definition IV. 1. Call x atomic in T when there exist A x , v x , 
and Tl x such that F : x JJ-n^ A x : v x and diff(Jl x ) = {x}. 
Write atomic(T) for the set of atomic variable symbols in T. 

So, x is atomic in Y when it can be computed without 
affecting the rest of the heap. Therefore, as Corollary IV.3 will 
prove, if Y(x) is a value then x is not atomic in Y. An impor- 
tant observation is that although the evaluation of an atomic 
variable does not affect the rest of the heap, other bindings of 
the heaps may well be required for its evaluation. For example, 
for r = {x\ i— > (Xx.x) X2, 12 ^ Xx.x}, the variable x\ is 
atomic in Y. This is because for Y : x\ JJ-n _ : Xx.x, the only 
affected binding is that of x\ itself, i.e., diff(IY) = {xi}. This 
is despite the fact that x 2 G V(II) too. 

It turns out that the notion of atomic variables is extremely 
useful in the proofs to follow. Note that a variable is said to 
be atomic with respect to a heap, as opposed to a derivation. 

We will prefer the following slightly more succinct charac- 
terisation of atomicity: 

Lemma IV.2. x G atomic(Y) iff there exist v x and H x such 
that Y : x JJ-n,. Y[x i— > v x ] : v x and dijf(H x ) = {x}. 

Corollary IV.3. Suppose Y(x) is a value. Then, x ^ 
atomic (Y). 

Proof: Take Y(x) = v x for some value v x . By 
Lemma III.4, Y : x JJ. n T : v x . Therefore, diff {XV) = 0. In 
particular, diff (LI) ^ {x}. By Lemma IV.2, x ^ atomic(Y). 

m 

Lemma IV.4 states that atomicity does not change upon 
evaluation when the variable is not touched. On the other hand, 
Lemma IV.5 explains when, from atomicity of a variable in 
one heap, we can deduce its atomicity in another heap. 

Lemma IV.4. Suppose x G atomic(Y) and Y : x JJ- _ : v x . 
Suppose also Y : e JJ-n A : v, and x $ F(II). Then, x G 
atomic(A), and A(x) = v x . 

Lemma IV.5. Suppose Y : x ij-u x and Y(z) = Y'(z) for every 
z G V(Yl x ). Then, x G atomic(Y) implies x G atomic(Y'). 

Proof: Using Theorem III. 14. ■ 

B. Atomic Variables and Derivations 

This section provides three pivotal results which validate 
INMB, a.k.a., induction on the size of diff(YY) for our op- 
erational semantics: Theorems IV.9 and IV 10, respectively, 
explain how to remove and restore an atomic variable from/in a 
derivation to obtain another (valid) derivation. Theorem IV 1 1 
proves that any derivation either manipulates no bindings, or, 
of the manipulated bindings at least one binds an atomic 
variable in the original heap. (See Notation II. 9 for the 
definition of original heap.) We start by providing a new 
way of transforming derivations with special care on atomic 
variables (Definition IV.6). On paying special attention to 
atomic variables, we are following [ ] rather than [3,4]. 

Definition IV.6. Suppose Y : e JJ-n A : v such that x G 
F(II). Suppose also that x G atomic(Y), so that in particular 



r : x JJ-n^ &-x '■ v x far some TL X , A x , and v x . We define 
a labelled tree IL([x i— > v x ]j by inductively transforming II 
based on its final rule. (Following our convention, we do not 
distinguish between T[x i— > v x ] : e JJ-ntfxi-m.,} A[x i— > v x ] : v 
and II([x i— > v x ]j when appropriate.) For each subcase, 

• when x ^ dom(T) or x £ V(II) \ afomjc(r), define 

nfi ^ d,) = n. (iv. i) 

• when x ^ ^(n), define 

n([x ^ v x } = U[x i-> v x }. (IV.2) 

The subcases where x £ atomic(T)n V(JV) are defined below: 

• (var x ). II takes the form 

r : e x j|_ r' 



I (var x 



Define II {[x 

(P^r^r 



(IV' 

Ua.]) to fee 



e x ) : x J| (r',a; i-> u K ) : u x . 



: Wx) 
(I',x 



(var x ) 



«x) : x JJ. (r',x i-> v x ) : v x . 



(let) anii (var y ) for y other than x. II takes the form 



T : e' J|n' A' : v' \^— T : e 4J. A 



vWiere (r) £ {(var y ), (let)}. Define II < 



a: l 



r'f 



s ' JJ-n'([i 



«x]) to be 

(r) 



«.| 



A'[x i— > u x ] : 1/ 

»J:4 A[ii-> u 2 



r[x i— > w^j : e 4 A [a; i— ► w^J : w. 
• (app) and (seq). II takes the form 

(r : ei 4 Hl © : «i, © : e 2 fe 2 A : v 2 ) ^-T : e JJ. A : v 
where (r) £ {(app), (seq)}. Define H([x i— > w^]} to fee 
(r[x i-> wa,] : ei JJ-n^a;^^]) B[x i-> u s ] : v t , 

Q[x i-> u-b] : e 2 JJ-n 2 ([a;^i,^ A[sm u s ] : w 2 ) 

i(r) 

H— ^ T[x i— > v x ] : e JJ. A[x i— > u^] : v. 
Note that (val) can never be the final rule when x £ 
atomic (r)(~l V"(II) because V(JV) = ('n f/zw case. Therefore, 
the recipe for this case is (IV.2) where IL([x i— > v x 
v x ] = II especially. (See Definition III. 10.) 



= Ulxi 



Remark IV.7. The following points about the (var x ) case in 
Definition IV.6 are worth considering: 

• Recall from Theorem III. 9 that there will be at most one 
non-trivial instance of (var x ) in II. In fact, the only 
difference between Definition IV.6 and Definition III. 10 
is that the former replaces the unique non-trivial instance 
of (var x ) for an atomic variable x with a trivial one, 
whilst the latter does not. 

• Suppose r : x JJ. and T : e JJ-n where V : x JJ-rp and II' C 
II. (See Notation II. 9.) The fact that r : x JJ. guarantees 
that x does indeed reduce in V . Furthermore, as seen in 
Lemma IV.5, if (val) is not the only rule used in II' , 
then x is atomic in V too. 

• Because x £ atomic(T), by Lemma IV.2, for any 
instance of (var x ) in II like (r", x i— > _) : x JJ. (A', x i— > 
_) : v x , we are guaranteed that T' = A'. Note that this 
includes both trivial and non-trivial instances of (var x ). 



Lemma IV.8. Suppose T : e JJ. n A : v and x £ V^II) n 
atomic (T). For any sub-derivation II' of II which contains no 
non-trivial instance of (var x ), II' ([x i— > v x ]) = TL'[x i— > v x ]. 

Proof: Suppose V : _ J|n'- There are two cases: 

• x ^ ^(II'). The result is correct by construction. (See 
(IV.2).) 

• x £ V(n'). By construction (Definition III.6), the fact 
that II' contains no non-trivial instances of (var x ), means 
that x £ diff(W). Therefore, 

- on one hand, by Lemma IV.2, x ^ atomic (V). 
Hence, by construction II' ([x i— > u x ]) = II'. (See 
(IV. 1).) 

- on the other hand, by Lemma III.7, T'(x) is a value 
like v x . Hence, by Lemma III. 11, II' [x i— > v x ] = II'. 



Theorem IV.9. Suppose T : e 4J-n A : v and x £ V^(n). Sup- 
pose x £ atomic(T); so in particular T : x JJ.n x T[x i— > v x ] : 
v x for some v x . Then A(x) = v x andT[x i— > v x ] : e i^uBx^v-A 
A : v. Furthermore, diff (II([x i— > v x \) = diff (II) \ {x} and 
V(IL([x ^ v x }) U V(U X ) = V(U). 

Proof: Induction on II based on its final rule: 

• (var x ). II takes the form 

I (var x ) 

r':e4_A':o \^lL 

(r', x i— ► e) : x JJ. (A', x i— > v) : v 
where T = (T',x i-> e) and A = (A',x i-> v). (Note 
that, as also notified in Remark IV.7, A' = T' here.) By 
construction (Definition IV.6), in this case, II ([x i— ► v x } 
will take the form 

, I (val) I (var x ) 

(f^T':v x ^T':v x )f^ i - 

(r', x i->- v x ) : x JJ. (T', x t-y v x ) : v x . 
Observe that, by Determinism (Theorem III. 2), II =| etQ 
II X . (See Notation III. 1 for =i e t Q -) Thus, x £ atomic(T) 
implies diff (II) = diff(U x ) = {x} by Lemma IV.2, and 

= dtff(U{[x ^ v x }) = diff (n) \ {x} 
V(IL(lx h-» v x }) U V(U X ) = {x} U V(U) = V(U). 

• (app). II takes the form 

(r : e' J| Hl © : let { Xi =ei}f =1 in Xz.e", 

Q : (let{x i=ei }™ =1 me")[y/z] J|n 2 A : v) 

I (app) 



T : e' z JJ. A 



v 



where n > 0. There are now two cases: 
- x £ F(IIi). By inductive hypothesis 0(x) = v x 
and T[x i-> v x ] : e! Jj-ni^an-.^} © : Az.e". 
By Lemma III.3, A(x) = 0(x) = v x . On the 
other hand, because 0(x) = v x , by Corollary IV.3, 
x ^ atmoic(Il2). Therefore, if x £ ^(n^), then 
Il2([x i—*- v x } = II2 by construction. (See (IV 1).) 
If x ^ V(U 2 ), then II([x i-> w^]} = II[x i-*- « x ] 
by construction. (See (IV.2).) However, given that 
0(x) = v x , by Lemma III. 11, n 2 [x 1— > v x ] = n 2 . 
Hence, in both cases, by the (app) case of Defini- 
tion IV.6, we can combine III ([x 1— > v x l with II 2 



to get II([x i— > v x ]/ and observe that F[x i— > v x ] : 
e ' V fyii6xi-M) m 1> A : v. The result follows. 
x <£ V(n a ). It follows that x G V(n 2 ). By 
Lemma III. 13, Y[x i— > u K ] : e' JJ-rMxi-Hj.J ©[ x l—> 
w K ] : let {xj=ej}" =1 in Xz.e". Moreover, by con- 
struction, ILi[x <— > uJ = III (fa; l— *• u x])- (See 



(IV.2).) Therefore, T[x 



l-'.r 



a- 



n^x^t.^ 



0[x i— > uj : let {xj=ej}™ =1 in Xz.e". On the 
other hand, by Lemma IV.4, x G atomic(Q), and 
: x JJ. 0a; : ^ for some 0^. By induc- 
tive hypothesis A(x) = v x and 0[x i— > v x ] : 
(\et{xi=ei}f =1 \ne")[y/z} JJn 2 ^^ A : v. We 
combine IIi(|x i— > v x J/ with IT.2(|x i— > u x ]) to 
get II ([x i— > v x } and observe that r[x i— > v x ] : 
e ' V JJ-ndxi-*^]) A : v. (Note that, by the (app) 
case of Definition IV.6, this combination is indeed 
II ([x i— > v x }.) The result follows. 

• (let). II takes the form 

I ( let ) 
(r, xi «-► ei)™ =1 : e JJ. n < (A, x t «-> e£)" =1 : w |— — 

r : let {xi=ej™ =1 in e JJ. A : let {x i =e' i }V' =1 in w. 

Observe that, by Lemma IV.5, x G atomic(T) implies 

x G atomic((T,Xi i— > ej)" =1 ). Therefore, by inductive 

hypothesis: 

- (r, Xj i-> ej)™ =1 [x i-> vj : e JJ-n'flxi-^]) (A,x 4 i-> 
e 'i)r=i : w which, by Lemma III. 12, can be rewritten 
as (T[x i-> w x ],Xi >-> ei)™ =1 : e JJ-n'^-^} 
(A,Xj i— > e'j)™ =1 : v. Extending with (let), we 
deduce T[x i-> v x ] : let {xj=ej}™ =1 in e JJ-nflsi-M;,,) 
A : let {xj=e'j}™ =1 inw. By the (let) case of Defini- 
tion IV.6, we know that this combination is indeed 
II([x i— > v x }, as desired. 

- (A, Xi i— > e'j)" =1 (x) = v x which, given that Xj's are 
fresh, implies A(x) = v x . 

Other cases are no harder. ■ 

Theorem IV.10. Suppose x G atomic(T) and T : x JJ _ : v x . 
Suppose T([x i— > v x ]j : e JJ-n' A : v and x G V(II'). Then 
r : e J|n A : v such that II ([x i— > v x ]j = II'. 

Proof: Similar to Theorem IV.9 except that, this time, we 
inductively transform II' to II. ■ 

Theorem IV. 11. Suppose T : e 4J-n A : v. Then exactly one 
of the following possibilities holds: (1) diff(lV) = 0. Or, (2) 
there exists an x G diff (II) such that x G atomic{Y). 

Proof: By induction on II based on the final rule: 

• (val). II takes the form 

I (val) 

P^T :v ii.T: v 



in which case diff (II) = 0. 
(let). II takes the form 

(r, Xi i-> ei)™ =1 : e JJ-rp (A, Xi 



4)?=i : « 



I (let) 



T : let {xi=ei}? =1 in e JJ A : let {x i =e' i Yi=i in «. 
If tKff(n') = 0, then (r,x; .-> ei )" =1 (y) = (A, a* h-» 
e 'i)?=i(y) f° r every y G dom((T,Xi i— > ej)™ =1 ). Because 



XjS are fresh (Definition II. 7), for every z G dom(r), this 
implies T(z) = A(z). Hence, diff (II) = too. 
Suppose diff(U') ^ 0. Then, by inductive hypothesis, 
there exists a variable x which is atomic in (r, Xi i— > 
e,)™ =1 . If x G dom(r), then, by Lemma IV.5, x is also 
atomic in T, as desired. If every such x is in {x!, . . . , x„}, 
then dijff (II) = and the result follows. 
Other cases are no harder. ■ 

V The New Inductive Principle and Uses 

Induction on the number of the manipulated bindings 
(Inmb) was first introduced in [ ]. Later on, [4] completed 
their proofs and showed the validity of this new inductive 
principle for their restricted operational semantics. Finally, 
[5] proved the correctness of this inductive principle for a 
non-restricted operational semantics for lazy evaluation with 
selective strictness. Neither [ ] nor its two succors [4,5] 
however outlined a mathematical formulation of induction on 
the number of manipulated bindings. Here is our formulation 
of this principle: 

[P(IL);diff(IL) = 0] 
A 
[Vk.(P(IL)-\\dtff(IL)\\ = k)^ 

(P(n);||rfiff(n)|| = fc + i)] 

(Inmb) 

Vn.P(II);|]dijf(n)]] = n 

In words, (Inmb) states that: "P(II) holds if: (1) P(II) 
holds when no binding gets manipulated over derivation II. 
And, (2) validity of P(II) when there are k manipulated 
bindings in derivation II implies validity of P(II) when there 
are k + 1 manipulated bindings." 

As an example for Inmb, we next prove Theorem V.4 using 
this technique. All the preliminary results and definitions are 
from [ ] so we drop the proofs except for Theorem V.4. We 
start by introducing a notion of equivalence between heaps: 

Definition V.l. Define l\ w r 2 by: 



Ve.V«.(pAi.ri :4Ai:«)» (3A 2 T 2 : e ij. A 2 : o)J. 
Lemma V.2. pa is an equivalence relation. 

Lemma V.3. Suppose x G atomic(Y), so in particular T : 

x JJ. r[x i— > v x ] : v x for some v x . Then T ra T[x i— > v x ]. 

Theorem V.4. // T : e JJ n A : v then Y » A. 

Proof: INMB. There are two cases: 

• diff(U) = 0. The result follows by reflexivity of pa 
(Lemma V.2). 

• diff(U) ^ 0. By Theorem IV.ll, there exists an x G 
diff (II) such that x G atomic (T), so in particular T : x JJ. 
T[x i— > v x ] : v x for some v x . By Lemma V.3, V pa T[x >—> 
v x ]. By Theorem IV.9, T[x i— > v x ] : e i^nSxt-yvJ) A : v 
(Definition IV.6). By Theorem IV.9, diff(IL[x ^ v x \) = 
diff (U) \ {x}. By inductive hypothesis, T[x i— > v x ] pa A. 
We use transitivity of pa (Lemma V.2). 



Using a more complicated version of our (INMB), [3,4] 
prove a number of observational equivalences between lazy 
programs in presence of selective strictness. For demonstra- 
tion, we only present two types of observational equivalences 
they define: 

Definition V.5. Define e.\ ~^ e 2 by: 

Vr, A. (3 Vl . r : ei 4 A : Vl ) O (3v 2 . T : e 2 4 A : v 2 ). 

Intuitively, e\ ~h e 2 when e\ and e 2 compute the same 
final heaps, given the same initial heap; ~^ does not examine 
the final values, which may differ. 

Definition V.6. Define e.\ ~ s e 2 by: 

Vr, v, A. (r : ei 4| A : v) O (r : e 2 J| A : w). 

Intuitively, e x E~ s e 2 when, given the same initial heap, e^ 
and e 2 compute the same final value and final heap. 

Two interesting examples proved in [3] and [ ] using INMB 
are then: 

1) Vei,e 2 . e 1 seq e 2 » h e 2 seq e a . 

2) Vex, e 2 , e 3 . e a seq e 2 seq e 2 « s e 2 seq e x seq e 3 

VI. Related Work 

Ong and Abramsky [ 1 2]— [14] studied call-by-need, but they 
did not capture sharing. Both [15] and [1] present a big-step 
semantics for call-by-need. Ariola et al. [8,16,17] considered 
call-by-need (as a calculus) rather than lazy evaluation, but 
did not capture recursive let-bindings. We choose to build 
on Launchbury's big step semantics [ ] which models lazy 
evaluation at the level of abstraction suitable for big-step 
observational equivalence. 

Earlier work addresses issues raised by selective strictness. 
Harrison et al. [ ] provides a calculational approach, in 
which every detail of how to implement a HASKELL inter- 
preter in HASKELL itself is explained. Later, [19] uses P- 
logic [ ] to specify and verify properties of programs with 
selective strictness. In [ ], van Eekelen and de Mol extend 
Launchbury's semantics to selective strictness. Other related 
works of this group are [9] and [ "]. We choose to directly 
reason on derivation trees of an operational semantics which 
is a variation of that of [2]. Johann and Voigtlander [ ] study 
free theorems in presence of selective strictness. [22]-[26] all 
extend this work but for call-by-name rather than call-by-need. 
Finally, Hidalgo-Herrero [27] offers a denotational semantics 
in which they can prove similar identities to the ones proved 
in [3] and [4]. 

The study of observational equivalence between lazy pro- 
grams in presence of selective strictness starts in Gabbay et 
al. [3]. Haeri [4] fills all the missing details in [ ]. Both the 
operational semantics of [3] and that of [ ] however have a 
restricted expressiveness. Whilst retaining all the interesting 
features of [ ] and [ ], Haeri [5] solved the expressiveness 
problem. Haeri [5] accomplishes this by carrying the let- 
bindings around with the evaluated expression rather than 
retaining them in the returned heap. Finally, Haeri [6] pro- 
vides yet another operational semantics based on that of van 



Eekelen and de Mol [ ]. Haeri proves equivalences between 
the operational semantics in [5] and [6]. His latter operational 
semantics is both simpler and proved to be as expressive as 
the latter operational semantics. 

An improvement in [ ] and [ ] (and thus this paper) over 
[3] and [4] is that the former group does not force the side 
condition x, ^ fa(v) for (let). Hence, a group of arguably 
reasonable lazy programs which do not reduce in [3] and [4] 
do reduce in our system. As explained in [ , §6.2], let y = 
Xx.x in Xx.y is an example of this group where y E fv(Xx.y). 

The motivation of [3] and [4] for adding the Xi ^ fv(v) 
side condition (Fig. 2) is to prevent the reduction of certain 
expressions from midway failure. Xi ^ fv(v) aims this 
through illegalising the expressions their lack of bindings is 
expectable over the next evaluation stages. As explained in 
[5, §6.1], an example of such expressions is e = let x' = 
Xx.x in (let {xi = Xx.x,x 2 = Xx.xi} in x 2 ) x' . In their 
system, without the Xi £ fv(v) side condition, the reduction 
of e would halt once it reaches the evaluation of x\. This is 
because, at that point, x\ is no longer bound in the respective 
heap. Lazy programs like e which do not reduce in [ ] and 
[4] do reduce in our system. 

Despite these differences, it is easy to prove the following 
result. Note that, the inverse implication is not correct. 

Theorem VI.l. Suppose V : e 4| s A : Xx.e'. Then, T : e J| 
A : let {xj=e;}™ =1 in Xx.e' for some {xi}" =1 where n > 0. 

As proved in [6], our system is as expressive as that of van 
Eekelen and de Mol [2]: 

Theorem VI.2. Suppose T : e ty vd (A,x t >-> ei)™ =1 : 
Xx.e' such that dom(T) = dom(A). Then, T : e 4J. A : 
let {xj=ej}™ =1 in Xx.e' . 

VII. Conclusion and Future Work 

In this paper, we extended Launchbury's model for lazy 
evaluation: In §11, we follow Ariola and Felleisen by including 
let-surrounded abstractions in values. Consequently, we end 
up having a new application rules. Besides, our (let) rule, 
in the same time that performs garbage-collection in the 
relevant heap, retains the (probably manipulated) let-bindings 
by returning let-surrounded values. As explained in §VI, our 
special form of garbage-collection, retains enough information 
for later reference. Consequently, we dismiss the restrictive 
side condition in (let ) which prevented [ ] and [ ] from 
fully modelling lazy evaluation. As a result, we accomplish 
full modelling of lazy evaluation in the same time that our 
operational semantics avoids heap growth of expressiveness 
upon evaluation of let-expressions. Finally, our operational 
semantics borrows a variation of van Eekelen and de Mol's 
(StrictLet) [2,11] for selective strictness. Our operational 
semantics is as expressive as that of van Eekelen and de Mol. 

Different interplays between heaps and derivations are dis- 
cussed in §111. Atomic variables and their individual properties 
are investigated in §IV-A. The most important part of this 
paper is §IV-B where the pivotal role atomic variables play 



Syntax: e 

Semantics: 



:= x | Xx.e | ex | let {x i =e i \™ =1 in e | e seq e v ::= Xx.e 

(lam) T:e^ Q:\y.e' & : e'[x/y] i^ G A : v 



r : Xx.e JJ. r : Ax.e 



T-.ex^A-.v 



(app) 



(r,Xi^ ei )^ =1 :eJJ. s (A.Xi^e^)^^ ( Xi gfv(v), Xi fresh) 
T: let{xi= ei }™ =1 ine^ e A : v 



(let ) 



Fig. 2. Differing Parts of the Syntax and Operational Semantics of [ ] and [ ] 



in derivations is studied. Proofs of §IV-B validate induction 
on the number of manipulated bindings (INMB) for our opera- 
tional semantics. INMB is outlined in §V with a few interesting 
uses of it. Related work is finally reviewed in §VI. 

Various routes can be taken for future work: Extending 
our work to parallel lazy languages like GpH [28] is one 
example. The study of observational approximation for certain 
conditions when the change in heap expressiveness is permis- 
sible is another option. For applications like [ , ], in proving 
observational equivalence between lazy programs, heaps very 
much play the customary role of Contexts. However, one might 
still consider investigating more powerful notions capable 
enough to cope with customary contexts too. For example, 
extending current technology not to distinguish between the 
lazy behaviour of Xx.x+ 1 and Xx.l + x is desirable. Finally, 
investigating the validity of a context lemma [29] for the ~ s 
of [3,4] is also worth considering. 
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